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THE APPLICATION OE THE STATISTICAL THEORY OF EXTREME VALUES 

TO GUST-LOAD PROBLEMS 

By Harry Press 


SUMMARY 

An analysis is presented which, indicates that the statistical 
theory of extreme values is applicable to the problems of pre- 
dicting the frequency of encountering the larger gust loads and 
gust velocities for both specific test conditions as well as com- 
mercial transport operations. The extreme-value theory pro- 
vides an analytic form for the distributions of maximum values 
of gust load and velocity. Methods of fitting the distribution 
are given along with a method of estimating the reliability of 
the predictions. 

The theory of extreme values is applied to available load data 
from commercial transport operations. The results indicate 
that the estimates of the frequency of encountering the larger 
loads are more consistent with the data and more reliable than 
those obtained in previous analyses. 

INTRODUCTION 

In the investigation of the loads imposed on airplanes 
due to gusty air, flight measurements of loads and gust 
velocities have been used to obtain estimates of the expected 
logd experience under various operating conditions. For the 
problem of designing airplanes for ultimate static failure, the 
larger gust velocities are of particular interest, as they are 
likely to cause structural damage. Because the larger gust 
velocities are infrequent, measurements available from limited 
samples of data will generally not extend to the larger and 
critical values of gust velocity and load. Consequently, an 
important problem in these investigations is the development 
of techniques for estimating the probabilities of encounter- 
ing these larger values. 

Previous analyses of data obtained from gust-load investiga- 
tions have utilized the statistical approach and considered 
the measurements obtained as random samples for the 
conditions studied. In the analysis of records obtained 
from investigations with V-G recorders (references 1, 2, and 
3), the largest values of the significant variables have been 
selected on some convenient and consistent basis. Frequency 
distributions of the largest values have been represented 
by fitted Pearson type III probability curves. Experience 
with the use of curves of this type has indicated, however, | 
that the estimates of the probabilities of exceeding the larger 
and extrapolated values of the variables cannot be con- 
sidered reliable. In some cases, there has appeared evidence 
that the estimated probabilities of exceeding the larger 
values were too low. A further difficulty, resulting from the 


arbitrary selection of the curve type, has been the inability 
to derive satisfactory methods of measuring the reliability of 
extrapolated predictions. 

Recent developments in the statistical theory of extreme 
values (references 4 to 10) have indicated a somewhat more 
rational approach to the problem of predicting the probability 
of occurrence of the extreme values. In some cases, this new 
approach provides a more satisfactory solution to the 
statistical problem of the determination of the form of the 
distributions of largest values. In addition, methods are 
available for obtaining measures of the reliability of predicted 
values. The present report summarizes some of these 
findings, indicates the method of application, and evaluates 
their applicability to certain gust-load problems. 

SYMBOLS 

number of observation in order from smallest to 
largest 

number of observations from which a ma ximum 
is selected 

total number of observations of maxim um values 
effective gust velocity, feet per second 
true gust velocity, feet per second 
normal-acceleration increment, g units 
airspeed, miles per hour 
design level cruising speed, miles per hour 
airspeed at which maximum acceleration incre- 
ment occurs on V-G record 
random variable 

772 th value from lowest of n values of x 
second largest value of n values of x 
largest value of n values of x 
probability density function of x 
cumulative probability distribution of x defined 

by JL w(x)dx 

TT(z) 

probability density function of maximum values 
of x 

cumulative probability distribution of ma ximum 
values of x 
-W*(x) 

recurrence period of x, average number of observa- 
tions required to equal or exceed given values 
of x 
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y 

y m 
v*(y) 
V*ly ) 

( a V n) m 


8 

o 1-14 


reduced variable defined by y=a(x—u ) 
mth value of n observations, of y 
probability distribution of reduced variable y 
cumulative probability distribution of reduced 
variable y 

reduced standard error of mtli of n observations 
defined by equation (13) 

sample estimate of <r given by 


V : 


Sfo-T ) 2 
n — 1 


S , -,= 
S m 


0.754 

x-D ' ; 3 : - 

standard deviation of mth value of n observations 
0 f * 

\ oi\n } 

statistical parameter of distribution of extreme 
values defined by equation (5) 
statistical parameter of distribution of extreme 


values 


( w(u) \ 

li-nW 


universe mean value of random variable given by 


r 

J— <* 


x w(x)dx 


standard deviation of random variable defined by 
{x~~n) 2 w(x)dx J 

average number of hours per V-G record 


Subscripts: 


LLF design limit load factor 

max maximum value of random variable for specified 

sample 

A bar is used over symbols to indicate the mean value of n 

values of a random variable given by — A tilde ~ is used 

above a statistical parameter to designate the true universe 
value. 


Consideration of recent, work in the theory of extreme 
values has indicated that this theory would appear applicable 
to gust-load problems if the initial distribution of the vari- 
ables satisfied certain requirements. The theory of extreme 
values indicates that for certain initial distributions a limiting 
form exists for the distributions of the maximum values. In 
particular, for initial distributions of the exponential type, 
the limiting form of the distribution of maximum values has 
been found (reference 5) to be a simple analytic function. 
This case appears applicable to certain phases of gust statis- 
tics and will, therefore, be discussed in some detail. 

Consider a random variable x having a probability distri- 
bution w(x). The variable x is assumed to have no upper 
limit; that is, w(x) is greater than zero for all increasing values 
of x. The cumulative probability distribution from below 
is then given by _ ‘ ~ 

Tr(ar)=J* w(x)dx (1) 

and indicates the probability that a measurement is less than 
a given value of x. Similarly, the cumulative probability 
distribution from above is defined by 



and indicates the probability that a measurement is greater 
than the given value of x. The probability that a value x is 
a maximum of N observations may be obtained from the 
product probabilities and is given by 

w(z)t/a-J (3) 


ir*(x) = [Il'(*)]‘ v =[£ w 


A simple analytic expression for ll'*(a:) is of course not 
possible, in general, inasmuch as it depends on the form of 
the initial distribution. If N is very large, asymptotic solu- 
tions are possible for certain forms of the initial distribution 
w(x). In particular, if w(x) is of the exponetial form, it has 
been shown (reference 5) that 


BASIC STATISTICAL CONSIDERATIONS... 

In the application of statistical methods to gust-load 
problems, the maximum values of such gust variables as 
normal acceleration and effective gust velocity obtained 
under given conditions have frequently been selected for 
analysis. The frequency distribution of the maximum 
values obtained from successive" samples has been utilized to 
obtain estimates of the probability of encountering extreme 
values. The problem of obtaining estimates of the probability 
of encountering the extreme values would be considerably 
simplified by the a priori determination of the underlying 
distribution of the maximum values of the variables. The 
knowledge of the underlying distribution should also greatly 
increase the accuracy and reliability of the required estimates. 


W*(x)=e~‘~ aix ~ ,t} 

(4) 

where u is the expected largest value defined by 


•H 

1 

S 

II 

H- 1 

1 

(5) 

and a is defined by 

vj(u) 

“ i-ir(w) 

(6) 

The significance of the parameter a is indicated subsequently. 

The distribution of the maximum values of x may be 
obtained from equation (4) and is given by 

w*(x)=j^ W*(x) = a e- a(x ~ u) -‘~ a{t ~ u) 

00 
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The distribution of maximum values given by equation 
(7; is a two-parameter family of curves with parameters u 
and a. The distribution has positive skewness with a modal 
value less than the mean. 

In order to simplify the foregoing results, the linear trans- 
formation defined by 

y=a(x— U) (8) 

is utilized to transform equation (7) to a simpler form. From 

equations (4), (7), and (8), the following equations are 
obtained: 

V*(y)= W*(x)=e-‘~ 1 ' (9) 

and 

v *(y)=e-’>-‘- v (10) 

The distributions of equations (9) and (10) are shown in 
figures 1 and 2, respectively. It is noted from figure 2 that 
the distribution of equation (10) has one and only one 
maximum at y= 0 and, therefore, the mode of the distribution 
of largest values given by equation (7) must be equal to u. 

For a known distribution, the rate of increase of the maxi- 
mum value of a variable with increasing sample size is 
specified inasmuch as the average number of observations 
required in order to equal or exceed given values of the 
variable may be determined. If a function T(x) is defined 
as the average number of maximum values required in order 
to equal or exceed given values of x, analysis has indicated 



Figure l.— CumulatiTe probability distribution function 



Figcbe 2.— Probability density function * (equation 10). 


that- the value of x exceeded in T(x) observations will increase 
as a linear function of log* Tfr). The results obtained in 
reference 6, page 178, indicate that for large values of x 

log, T{x)~a(x— u) (11) 

or 

logic T{x) *0.4343 a(z-ti) (12) 

For values of x such that 7 T (r)>10, this relationship is a 
sufficiently close approximation for most purposes. This 
result indicates that for large values of x the average number 
of samples necessary to equal or exceed given values of x 
converges toward a simple exponential function of x. From 
equation (11), it is also apparent that the derivative of x 
with respect to the log, T(x) is a constant and is given by 
1 fa. The parameter a, thereby, specifies the rate of in-., 
crease of the maximum value of the variable with increasing 
sample size. 

RELIABILITY OF PREDICTION 

Equation (7) gives the probability distribution function 
of maximum values obtained from samples of a random 
variable with a specified type of initial distribution. A 
sample of maximum values may then be used to obtain 
estimates of parameters of the population of maximum 
values. It would be expected, however, that successive 
samples from the same population will indicate some dif- 
ferences in parameter estimates due to chance, the magnitude 
of the differences depending on the sample size. In practice, 
the parameters of the population are seldom known and 
sample estimates must be used. Methods have been devel- 
oped for measuring the reliability of sample estimates by 
indicating a range within which, for a given probability level, 
the population value can be expected to lie. (See, for 
example, reference 11.) 

A simple and rapid method of indicating the reliability of 
sample estimates of the extreme-value distribution function 
called “control curves” has been derived by Gumbel in ref- 
erences 8 and 9. These control curves provide a measure, 
for a given probability level, of the range within which the 
true population value may be expected. Kimball in ref- 
erence 10 has suggested a more precise, though considerably 


(equation 9). 
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more complex, method of obtaining confidence limits for the 
distribution of extreme values. In the interest of simplicity, 
the present discussion will be restricted to the method given 
by Gumbel. 

The method of obtaining control curves derived by Gumbel 
depends essentially on the determination of control intervals 
at discrete points along the distribution by the utilization 
of the properties of the distribution of the mth , values. 
Within the range 0.1<TT*(r)<0.9, the distribution of 
the mth values of a continuous variable having moderate 
skew distribution and possessing a mode is asymptotically 
normal around the most probable mth value (reference 5) 
with a standard error (<r^n) m given by: 


/ r-s 


-(13) 


The function ( <ryn) m is hereinafter called the “reduced 
standard error” and is shown as a function of F*(x m ) and y 


in figure 3. A horizontal interval around the true mth 
value given by 

where S m is defined by 

(15> 

a\n 

gives a probability of about 0.C8 that the mth value in a 
sample of n observed maximum values will lie in the enclosed 
interval. Similarly, the interval z m ±2S m has a probability 
of about 0.95 of enclosing the mth value from a sample of a 
observed maximum values. 

In order to obtain a control interval at the larger values, a 
fundamental property of the distribution of maximum values 
may be utilized. The most probable largest value of n 
maximum values has been shown (reference 4) to have a 
distribution of the same form as the distribution of the n 
maximum values with the origin shifted to the right by a 



Figure 3.— Reduced standard error for mth values. 
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distance equal to the log, n and with the standard deviation 
unchanged. Consideration of the probability distributions 
■of extreme values indicates that an interval around the most 
probable largest value having a probability of about 0.68 
is given by 

?,-S.<?.<3f,+S, (16) 


where 



(17) 


Similarly, for a probability level of 0.95, the control interval 

2.97 

around the most probable largest value is given by x n ± — — 

Since the control interval around the most probable largest 
value does not depend upon the number of observations n, 
Gumbel Ijas suggested that a control interval of equal width 
may be extended along the extrapolated portion of the curve. 

In order to fill the gap in the control intervals for those 
values not near the median nor the extreme value, the fol- 
lowing equation was obtained by Gumbel in reference 9, 
page 11, for the control interval around the most probable 
penultimate value: 

(18) 


where S',-! is an approximation given by 


S, 




0.754 


(19) 


(3) If either or both control intervals do enclose the value 
obtained from the other sample, the differences cannot be con- 
sidered real and may be attributed to sampling fluctuations. 

Although the foregoing test for significant differences 
cannot be considered rigorous, it would appear to be a 
reasonable test designed in accordance with the levels of 
significance co mm only used in statistical tests. 

DETERMINATION' OF PARAMETERS 

The preceding analysis has presented a basic distribution 
for fitting distributions of maximum values provided that the 
initial distribution is of the simple exponential type. The 
process of fitting the distribution of extreme values to 
observed frequency distributions requires the estimation of 
the parameters u and a from the sample data. Several 
methods are available for the determination of the parameters. 
The accuracy with which the parameters can be estimated 
depends upon the number and accuracy of observations 
available. In practice, a minim um of about twenty-five 
observations has been found generally to be required. 

For large samples of data, n>75, the relations obtained 
for the parameters from the method of moments have 
yielded satisfactory results. The method of moments gives 
the following asymptotic relations for the required param- 
eters (reference 5) : 


c 

u—p — 
a 

(20) 

1 _ \6<r 

CL IT 

(21) 


The control curves defined by equations (18) and (19) have 
a probability level of 0.68. For a probability level of 0.95, 
the numerator of equation (19) becomes 1.73 instead of the 
value of 0.754. 

Inasmuch as the foregoing method allows the determina- 
tion of control intervals around the sample distribution at 
discrete points along the distribution, fairing a smooth curve 
through the points is necessary to obtain a continuous 
control curve. 

Closely related to the control intervals is the problem of 
the significance of observed differences between the distribu- 
tions of samples of data. In the analysis of gust-load data, 
it is frequently required to determine whether the differences 
in probabilities of occurrence of extreme or critical values 
between two samples are significant. The following pro- 
cedure based on the control curves presented herein is sug- 
gested as a test of significant differences: 

(1) Obtain the control intervals with a probability of 0.95 
for the two samples at the value of x at which the comparison 
is to be made. 

(2) If each of the control intervals does not enclose the 
probability value obtained from the other sample, the ob- 
served differences may be considered significant. 


where c is Euler's number and equals 0.5772 and p and c 
are the mean and standard deviation, respectively, of the 
universe distribution of maximum values. If equations (20) 
and (21) are assumed to be true when the parameter values 
are replaced by their respective sample estimates where p 
is estimated by x and a is estimated by s, the following 
equations are obtained: 

_ c 
u=x — 

a 

and 

l _V6s 

a tt 

For samples of data, estimates of the values of v. and a may 
be determined from equations (22) and (23). The values 
of the parameters obtained for u and a are then utilized with 
the transformation equation (8). For given values of x, 
equation (8) gives “equivalent values” of y Rvalues of y 
having the same probability of being exceeded as the given 
value of a:). The probabilities that a value of y and the 
associated value of x will be exceeded may then be obtained 
simply from table I. 


( 22 ) 

(23) 
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For smaller samples, more precise methods of estimating 
the parameters are generally required in order to obtain 
satisfactory representation of the observed cumulative 
frequency distribution. One such method involves., the 
utilization of the transformation equation (8). Observed 
values of x may be transformed to values of y by equating 
the observed points of the cumulative frequency distribution 
with the distribution of F*(y) . The principle of least squares 
may then be applied to arrive at estimates of the required 
parameters in equation (8). The two normal equations 
that are obtained from the principle of least squares are 


n na 


(24) 


2 x V- u 
n n' na 


(25) 


and may be solved simultaneously. Gumbel (reference 9) 
has suggested a simplification of the calculation by obtaining 
from equations (24) and (25) the following equations, in 
terms of the sample estimates: 


x=u+- 

a 

« 2 ( s „) 2 


(26) 

(27) 


For very large samples, y approaches 0.5772, Euler’s number, 
and (s„) 2 approaches ir 2 /6; these values are the same as 
those obtained from the method of moments. 

In the analysis of small samples, the procedure to be fol- 
lowed in determining equivalent values of y for each of the 
observed values of x requires the enumeration of the observa- 
tions in order of size from smallest to largest. For given 

values of x, the ratio of — — ~~~ gives the proportion of 

observations equal to or greater than the given value. This 
ratio, sometimes called the recurrence ratio, gives a measure 
of the probability F*(x) that the given value of x will be 
equaled or exceeded. Table I may be used, simply for the 
conversion of observed recurrence ratios as observed esti- 
mates of F*(y). One observation is lost by this procedure 
because the recurrence ratio for the'smallest observation is 
equal to 1 and an equivalent value of y cannot be determined. 
The loss of the smallest value is of little consequence 
because the principal interest lies with the other end of the 
distribution. 

The values of the parameters obtained by the use of 
equations (26) and (27) can then be used with the trans- 
formation equation (8) to obtain the probability distribution 
in the same manner as previously indicated for the case of 
large samples. 


In order to illustrate the application of the foregoing 
methods to gust-load data, two examples arc selected and 
the methods of calculation indicated for the case of both the 
large and small samples. 

Example 1. — Gust-velocity measurements were available 
for 485 traverses of thunderstorms from the 1946 operations 
of the U. S. Weather Bureau Thunderstorm Project. The 
frequency distribution of the maximum values of gust veloc- 
ity per traverse is shown in table II. The relations obtained 
from the method of moments, equations (22) and (23), are 
used to determine the values of the parameters u and a as 
indicated in the table. By utilizing the values of the param- 
eters, the transformation equation given by 

£7,— 12.8370 
y 4.8263 

is obtained. For given values of U„ transformed or equiva- 
lent values of y are obtained. Table II also indicates the 
equivalent values of y for given values of V, from 2 to 48 
feet per second. The associated probabilities of exceeding 
the indicated values of U e , obtained from table I, are also 
shown. Figure 4 illustrates the fitted extreme-value proba- 
bility distribution along with the cumulative-data points. 

Control intervals for 68-percent anti 95-percent probability 
levels were determined by means of the relations given by 
equations (13) to (19) and the results obtained are shown in 
table III. The control intervals were then used to obtain 
the faired control curves shown in figure 4. 

Example 2 . — Records obtained during recent operations 
of a modern transport airplane were selected for the second 
example. Twenty-six V-G records, each representing roughly 
250 hours of flight operation, were available for a particular 
airline operator and route. The twenty-six maximum 
accelerations obtained from the records are given in table 
IV. The necessary operations in the evaluation of the 
parameters u and a are also indicated in the table. The 
values of the parameters obtained give the following rela- 
tion for the transformation equation: 

An— 0.8674 

. y 0.2874 

The transformation equation and table I were then used to 
obtain the probabilities of exceeding given values of An, 
and the results obtained are shown in the table. The proba- 
bilities of exceeding given values of An along with the 
cumulative data points are shown in figure 5. 

The relations for the control intervals given by equations 
(13) to (19) were used to obtain the 68-perccnt and 95-perccnt 
control, intervals for several values of F*(An) and the 
results obtained are shown in table V. The continuous 
control curves obtained by fairing are shown in figure 5. 
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Figure 5.— Probability that the maximum value of acceleration increment per record will exceed indicated value; V-G data; example 2. 
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APPLICATION AND RESULTS 

The fundamental variables generally considered in gust- 
load investigations include: the effective gust velocity U e , 
the normal-acceleration increment An, the true gust velocity 
U i, and airspeed F. In order to determine whether the 
distribution of extreme values is applicable to distributions 
of maximum values, consideration of the initial distributions 
of the variables is required. The available data on the 
distribution of the aforementioned variables have con- 
sequently been examined. 

Distributions of effective gust velocity obtained under a 
variety of operating conditions Have been reported in reference 
12. At the present time, however, the only extensive dis- 
tributions of gust valuables have been obtained in cloud 
flight from the operations of the U. S. Weather Bureau 
Thunderstorm Project (reference 13) and the NACA air- 
plane investigations of reference 14. The relative frequency 
distributions of U t , An, and U t obtained from these in- 
vestigations are presented in figures 6, 7, and 8, respectively. 

Extreme-value distributions were fitted to distributions of 
maximum values of An available from Y-G records, in order 
to compare the results obtained by the use of the extreme- 
value theory with those obtained in previous analysis of 
V-G data. Thirteen sets of records were available for 
analysis although six of these had only 11 to 19 records each. 
The remaining seven sets had more than 25 records with 



(a) IiM6 Thunderstorm Project data. Total number of observations, 27,4K. 
FtGUBEfl.— Distribution of eSective gust velocity CV. 


one exception, for which only 24 records were available. 
One of these seven sets was discarded because during one 
flight a hurricane was encoimtered which resulted in an 
unusually large acceleration. This unusual occurrence 
appears to preclude a simple statistical treatment of the 
data for this set. 

Extreme-value distributions were fitted to the six sets of 
acceleration increments in accordance with the methods 
previously outlined. The average miles of flight required to 
exceed given values of acceleration increment were determined 
by using the relation 

0.81 1, 

Mileage= — -p — r 

where P is the probability that a given maximum value for a 
record will exceed the given value, r is the average flight 
hours per record, and the airplane is assumed to fly at- an 
average speed of 0.8 V L . 

As a basis of evaluation of the results obtained, the total 
miles of flight represented by each set of data are compared 
with the estimated number of miles (obtained from the 
probability curves) required to exceed the largest value of 
acceleration increment actually measured in each set. 



(b) I9£7 Thunderstorm Project data. Total number of observations, 22,960. 
JFrGcntE 6— Continued. 
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(c) Data from XC-35 afrplana investigations during 1841 and 1842. Total number ol 
observations, 6,032. 

Figure 6— Concluded. 



(a) 1946 Thunderstorm Project data. Total number of observations, 21,385. 
Figure 7.— Distribution of normaS-acoeleration increment An. 



(b) 1847 Thunderstorm Project data. Total number of observations, 17,367. 
FlfttTtK 7— Continued. 


Figure 9 (a) presents the results obtained for the six sets of 
V-G data. The straight line shown in the figure indicates 
equality between predicted and actual mileage. For pur- 
poses of comparison, a similar plot is shown in figure 9 (b) 
for the same samples of data but with the predicted mileage 
values obtained by means of fitting Pearson type 111 prob- 
ability curves to the distributions of maximum acceleration 
increments. 

Figure^ 10 presents a comparison of the average life to 
limit load factor as obtained by using extreme-value and 
Pearson type III distributions for the six sets of data. 
Inasmuch as the estimates are generally extrapolations, a 
measure of the degree of extrapolation, the ratio of the 
maximum acceleration increment to the limit acceleration 
increment, is also given for each sample in the figure. 

In addition to the application of the extreme-value distri- 
bution to distributions of maximum values of An from V-G 
records, some efforts have been made to apply this distribu- 
tion to the observed distributions of maximum values of 
U t and V m ai obtained from V-G records. Although the 
results obtained appear indicative, they do not appear to 
warrant presentation in detail at this time. 
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DISCUSSION 

The foregoing results have indicated that the distribution 
of extreme values given by equation (7) is the limiting form for 
distributions of maximum values where the maximum values 
are selected from large samples from an initial distribution of 
the exponential type. Methods have been presented for the 
fitting of extreme-value distributions. The methods require 
the estimation of only the first two moments of the distribu- 
tion and may be applied simply and rapidly. Methods have 
also been presented which allow the estimation of the refla- 



te) Data tom XC-35 airplane investigations during 1341 and 1912. Total number of 
observations, 4,615. 

Floras 7.— Concluded. 


bility of the predicted probabilities of exceeding the extreme 
values. It would appear, therefore, that, if applicable, the 
use of distribution of extreme values for analysis of gust 
loads would offer significant advantages. 

In order to dete rmin e whether the distribution of extreme 
values is applicable to gust-load variables, the available 
distributions have been examined. Distributions of U, 
are available for a wide variety of test conditions covering 
turbulence within clear air, stratus clouds, cumulus clouds, 
thunderstorm clouds, and areas of radar echo. The avail- 
able data, although admittedly limited, nevertheless indicate 
that for each of the test conditions the distribution of effective 
gust velocity appears of the exponential type with variations 
in average spacing between gusts and variations in the 
standard deviations for the different distributions. Con- 
sideration of the most extensive distributions of U, shown in 
figure 6 indicates that the exponential distributions shown 
by the fitted straight line are good representations of the 
data. Therefore, the distribution of extreme values would 
appear to apply to the effective gust velocity for given test 
conditions. 

The available distributions of An have been obtained, 
largely, under test flight conditions in which the airspeeds 
have been kept constant or at least restricted. The linear 
relation between U, and A n in the sharp-edge gust relation 
would, consequently, be expected to yield the same form for 
the distribution of U, and An. The distributions of An 
shown in figure 7 substantiate the expectation that the 
form of the distributions of An is the same as the distribu- 
tions of U„ of figure 6. It may be concluded that for test 
flights at constant speed at least the distribution of An would 
remain in the exponential form and the distribution of ex- 
treme values would be applicable. 

The available evidence as indicated by figure 8 appears to 
indicate that the distribution of U t also follows a simple 
exponential distribution. The distribution of extreme values 
may, therefore, also be applied to distributions of maximum 
values of U t . 



Mi/es to exceed largest value of An 


REPORT 961 — NATIONAL ADVISORY COMMITTEE FOR AERONAUTICS 




03 / 


So / 


°l / 

06" 

/ 


4 / 


o / 


to « 

Miles flown 


I0 1 10 b 


Miles flown 


(a) Extreme values. (b) Pearson typo III. 

Fioube 9. — Miles flown as a function of miles to exceed largest observed normal-acceleration increment for six samples of V-Q records 
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Life to limit load factor 

FmuRE 10, — Comparison of two methods of predicting life to limit load factor from V-G data by use of one maximum. 
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The question of the applicability of the extreme-value 
distribution to the maximum values of U, obtained from 
Y-G records from commercial operations arises since these 
records generally cover a wide variety of operating and 
weather conditions. Although the available data, in some 
cases, appear to indicate that the distributions of U a are of 
the exponential form, they do not extend to sufficiently high 
values to be conclusive. Available information on commercial- 
transport gust experience and on the frequency distribu- 
tions of U, for various weather conditions indicates that the 
larger values encountered in commercial operations result- 
largely from flight through thunderstorms and convective 
clouds. On the basis, then, that the distribution of gusts in 
convective clouds is exponential, it seems reasonable to 
expect that the distribution of U e in commercial operations 
should have an exponential distribution at least for the larger 
values of gust velocity. Application of the distribution of 
extreme values would seem reasonable, therefore, to the dis- 
tribution of maximum values of U t obtained from Y-G 
records. 

In connection with the possible application of the distribu- 
tion of extreme values to maximum values of acceleration 
increment obtained from Y-G records of commercial trans- 
port operations, the question of the airspeed operating prac- 
tice may be of some importance. Apparently, a systematic 
variation of airspeed as a function of gustiness could con- 
ceivably cause an appreciable departure from the exponential 
form in the distribution of An even at the larger values. 
Little evidence is available, however, to indicate that such 
a systematic variation exists in practice. On the assump- 
tion, then, that operating speeds in areas of moderate to 
severe turbulence are largely confined to a narrow speed 
range for a given airplane or are independent of the intensity 
of the turbulence, the application of the distribution to the 
maximum values of acceleration increment is reasonable. 

Consideration has also been given to the applicability of 
the extreme-value distribution to maximum values of air- 
speed obtained from Y-G records. Little information is 
available, however, concerning the speed-time distribution 
of airspeed. Several attempts were made, consequently, to 
fit observed distributions by using the distribution of ex- 
treme values. The results have indicated that the observed 
distributions of V maz are frequently skewed negatively as 
compared with the positive skew of the distribution of 
extreme values. As a result, the agreement between the 
observed data and the distribution of extreme values was 
poor. The limited information available on the speed-time 
distribution of airspeed has further indicated that the dis- 
tribution of airspeeds follows no simple exponential form 
and depends largely on operational practice. The distribu- 
tion function of equation (7) would, therefore, appear not 
to be applicable to distributions of maximum values of 
airspeed. 

Another variable frequently studied in gust-load investiga- 
tions is the airspeed at which the maximum acceleration on a 
V-G record is encountered. By definition, this variable 
does not represent maximum values of airspeed and, as a 


result, no initial distribution that is directly related to this 
variable is apparent. Available distributions of To have 
been examined, however, and the results indicated that the 
distribution of extreme values of equation (7) is not appli- 
cable to distributions of these data. 

It has been indicated that the distribution of extreme 
values of equation (7) is a rational distribution form for 
purposes of fitting distributions of maximum values of An, 
U„ and U t when the maximums are selected from a large 
number of observations. The need remains for indicating 
that the application of this distribution yields accurate 
predictions. The verification of statistical techniques of 
this tji>e requires huge masses of data. Sufficient data are 
not available to enable a complete answer to this question. 
Available data may be used, however, to compare the 
estimates obtained with those obtained by the methods 
commonly used in the analysis of V-G records. 

The comparison shown in figure 9(a) between the estimated 
miles to exceed the highest observed value of An based on 
extreme-value methods and the actual miles flown indicates 
that the agreement is extremely good. In only one case is 
any appreciable difference noted and in this case the actual 
mileage is about twice the estimated mileage. The same 
comparison between the estimated miles to equal or exceed 
the maximum values obtained by Pearson type III curves 
and the actual miles flown is shown in figure 9 (b) . The figure 
indicates appreciably more scatter with the predicted miles 
always greater than the actual miles with the ratio of the pre- 
dicted to actual miles in two cases about 3 to 1 . This tendency 
toward overestimation of miles to exceed given acceleration- 
increment values is unconservative and may be misleading. 
On this basis, the extreme-value distribution apparently 
yields more reliable estimates of the frequency of encounter- 
ing the larger values. 

Since the miles required to exceed limit load factor is of 
particular concern, the comparison shown in figure 10 of the 
estimates to exceed limit load by the extreme-value and 
Pearson type III curves becomes of interest. Consideration 
of figure 10 indicates that the Pearson curves in every case 
yield larger mileage estimates.. The differences between the 
estimates at limit load factor vary appreciably as indicated 
by the ratio of the mileages which vary from about 1.3 to 
60. Consideration of the ratios of the maximum values of 
An observed to limit load factor for each set of data in- 
dicates that the magnitude of the differences between the 
results of the two methods increases with the degree of extra- 
polation required to reach limit load factor. 

In summary, then, the foregoing comparisons have in- 
dicated that, as compared with previous methods of analysis, 
extreme-value methods yield results more consistent with 
the data at the largest values. The differences between the 
estimates are considerably amplified when comparisons are 
made at extrapolated portions of the distributions. In view 
of this evidence and the rational foundation for the extreme- 
value distribution presented previously, the use of this 
distribution would generally be expected to provide more 
reliable estimates. 



724 


REPORT 991 — NATIONAL ADVISORY COMMITTEE FOR AERONAUTICS 


In the practical application of extreme-value methods to 
the analysis of V-G records, it should be noted that the 
extreme-value distribution of the maximum values of An may 
be used, in conjunction with the methods of reference 1, to 
predict the velocity-acceleration or V-N envelope for a given 
airplane. It has also been suggested that the determination 
of the V-N envelope should be .made by analysis of the 
accelerations by speed bracket. In this connection, the use 
of extreme-value methods would also seem proper provided 
that for each record a large number of accelerations are ex- 
perienced within each of the speed brackets chosen. 

Another problem.eneoimtered, in practice, is that estimates 
of the frequency distribution are frequently required for 
very limited samples, sometimes as few as ten records. In 
order to increase the sample size, the practice has been to 
use both the maximum positive and maximum negative 
acceleration increments from each record; the sample size is 
thereby doubled. Although this practice departs somewhat 
from the extreme-value theory, an analysis of available data 
indicated that the use of two maximums per record for the 
larger samples had only a minor effect on the estimates 
obtained. The use of two maximums per record would 
appear, therefore, to offer a useful basis for increasing sample 
size, particularly for very limited samples. 

CONCLUDING REMARKS 

An analysis of gust-load data by the use of the statistical 
theory of extreme values has indicated the following results: 

1. The theory of extreme values gives a rational form for 

the distribution of maximum values that appears applicable 
to distributions of maximum values of the effective gust 
velocity, true gust velocity, and normal-acceleration incre- 
ment obtained from test flights or commercial transport 
operations. 

2. Simple methods are available for fitting the distribution 
of extreme values to samples of data and for obtaining con- 
trol curves that provide a measure of the reliability of the 
estimates of encountering the larger values. A simple and 
reasonable, but nonrigorous, method of determining whether 
differences between two estimates may be considered sig- 
nificant is also presented. 

3. The application of the distribution of maximum values 
to available V-G data yields estimates of the frequency of 
encountering the larger acceleration increments that are 
consistent with the available data and appear to be more 
reliable than' the estimates obtained in previous analyses. 


4. Notwithstanding the fact that the methods described 
in the present report appear to be applicable, the possibility 
must always be kept in mind that, because of limitations of 
the method, available records, or both, a value may occur 
that far exceeds the limits of the theoretical predictions. 


Langley Aeronautical Laboratory, 

National Advisory Committee for Aeronautics, 
Langley Field, Va., July 22, 1949. 
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TABLE I.— CUMULATIVE PROBABILITY DISTRIBUTION F*(y)=l-e • 1,f 


V 

f *(») 

9 


r 

F *( t ) 

r 

F *( l ) 

- 2.50 
- 2.45 
- 2.40 
- 2.35 
- 2.30 
- 2.25 
— 2.20 
- 2.15 
- 2. 10 
- 2.05 
- 2.00 
- 1.95 
- 1.90 
- 1.85 
— L 50 
- 1.75 
- 1.70 
- L 65 
- 1.60 
- 1.55 
— L 50 
- 1.45 
-L 40 
- 1.35 
- 1. 30 
- 1.25 
- 1.20 
- 1. 15 
- 1. 10 
- 1.05 
- 1.00 
-.95 
-.90 
-.85 
-.80 
-.75 
-.70 
-.65 
-.60 
-.55 
-.50 
-.45 
-.40 
-.35 
-.30 
-.25 
-.20 
-.15 
-.10 
-.05 
0 

.05 

.10 

a 999995 
.999991 
.999984 
.999972 
.999953 
.999924 ' 

. 999 SS 
.99951 
.99972 
.99955 
.99938 
.9991 
.9988 
.9953 
.9976 
.9968 
.9958 
.9945 
.9929 
.9910 
.9857 
.9859 
.9527 
.9789 
.9745 
.9695 
.9638 
.9575 
.9504 
.9426 
.9340 
.9247 
.9145 
.9036 
.8920 
.8796 
. 8665 
.8527 
.8383 
.8233 
.8077 
.7916 
.7750 
.7581 
.7407 
.7231 
.7052 
.6571 
.6689 
.6505 
.6321 
.6137 
.5954 

0.15 
.20 
.25 
.30 
.35 
.40 
.45 
.50 
.55 
.60 
.65 
.70 
.75 
.80 
.85 
.90 
.95 
1.00 
L 05 
L 10 

1.15 
L 20 
L 25 
L 30 

1.35 

1.40 
L 45 
L 50 

1.55 
1.60 
L 65 
L 70 

1.75 
L . 8 Q 
1.85 
L 90 
L 95 
2.00 
2. 05 
2.10 

2.15 
2.20 
2.25 
2.30 

2.35 

2.40 
3.45 
2.50 

2.55 
2.60 
2.65 
2.70 

2.75 

giiBSiiiisiisssiBiiiaaiHasBisispsasssaapiillllBlsll 

2.80 

2.85 
2 . 9 Q 

2.95 

3.00 

3.05 

3.10 

3. 15 

3.20 

3.25 

3.30 

3.35 

3.40 
3-45 
3-50 

3.5 5 

3.60 

3.65 

3.70 

3.75 
3-80 

3. 85 

3.90 

3.95 

4.00 

4.05 

4.10 

4.15 

4.20 

4.25 

4.30 

4.35 

4.40 
4.45 
4.50 

4.55 

4.60 

4.65 

4.70 

4.75 
4.80 

4.85 

4.90 

4.95 
5 uQ 0 

5.05 

5.10 

5.15 

5.20 

5.25 

5.30 

5.35 

5.40 

0.05900 

.05620 

.05353 

.05100 

.04856 

.04625 

.04405 

.04194 

.03994 

.03303 

.03621 

.03447 

.03282 

.03125 

.02975 

.02831 

.02695 

.02565 

.02442 

.02325 

.02212 

.02105 

.02004 

.01907 

.01815 

.01727 

.01643 

.01564 

.01489 

.01416 

.01348 

.01283 

.01220 

.01161 

.01105 

.01051 

. Q 1000 

.009516 

.009054 

.008614 

.008196 

.007798 

.007420 

.007058 

.006716 

.006389 

.006078 

.005752 

.005502 

.005235 

.004950 

.004737 

.004507 

5.45 

5.50 

5.55 

5.60 

5.65 

5.70 
£75 

5.80 

5.85 

5.90 

5.95 
6.00 

6.05 
6.10 

6.15 
6.20 

6.25 

6.30 

6.35 
6.40 

6.45 

6.50 

6.55 

6.60 

6.65 

6.70 

6.75 

6.80 

6.85 

6.90 

6.95 

7.00 

7.05 
7.10 

7.15 
7.20 

7.25 

7.30 

7.35 
7-40 

7.45 

7.50 
7-55 
7.60 

7.65 
7-70 

7.75 
7.80 

7.85 

7.90 

7.95 

8.00 

. . P 


{Table values are accurate to ±1 iu last decimal place. 
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TABLE II . 

SUMMARY OF CALCULATIONS, 1946 U. S. WEATHER 
BUREAU THUNDERSTORM PROJECT DATA 
[Example 1J 


TABLE IV 

SUMMARY OF CALCULATIONS, V-G DATA 

[Example 2] 


I I 

n-m+1 






Derivation of transformation equation 
l7, = 15.6227 


j£f e =6.1899 


Z7.-I2.8370 


An 

({r units) 



Derivation of transformation equation: 
Sn-1.0200 

xAn -0.3202 


y— 0.530# 
1,-1.1141 
it— 0.8074 
. An— 0,8674 


0.2874 


l 

F\V.) 

Control interval 

68 percent 

95 percent 



(4.8263) (1.14)— 5.5020 

(4.8263) (2.97) =14.3341 


(4.8263) (0.75) (485) 

483 3 ' 6347 

(4.8263) (1.73) (485) 

483 8l3S41 

0.10 

(3.16)(4£263) 25 

V485 

(6.32) 08263) _ L * 850 
V485 

0.25 

(2.00) 

Vis! 

(f.OOlOT^ 

vnr 

0.50 

(1.44) (4=8263) ^ 
■y48o 

(2.88)(4£263)_ 083I2 

V4S5 


TABLE V 

CALCULATIONS FOR THE DETERMINATION OF 
THE CONTROL INTERVALS 


£ Example 2; i —0.2874 J 


Control interval 

68 percent 

95 percent 

(0.2874) (1.14) =0,328 

(0.2874) (2.97) -0.854 

(0.2874) (0.75) (26) 

24 ' “0.234 

(0.2874) (1.73) (20) fl ^ 
24 

(3.10) (0.2874) 0 178 

V26 

(6.32) ®.2S74)_ 0 35a 
V20 

(2.00) (0.2874) _ 0 n3 

V26 

(4.00) (0,2874) _ Q 

V20 

(1.44)(0.2874)_ 0 0812 

VS 

(2.88) (0.2874) _ 0162 

V26 








































